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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: SAMSON, MICHEL 

PARMENTIER, MARC 
VASSART, GILBERT 
LIBERT, FREDERICK 

(ii) TITLE OF INVENTION: ACTIVE AND INACTIVE CC-CHEMOKINES RECEPTOR 
AND NUCLEIC ACID MOLECULES ENCODING SAID RECEPTOR 

(iii) NUMBER OF SEQUENCES: 17 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Knobbe, Martens, Olson & Bear 

(B) STREET: 620 Newport Center Drive 16th Floor 
n (C) CITY: Newport Beach 

5 (D) STATE: CA 

5 (E) COUNTRY: U.S.A. 

Ui (F) ZIP: 92660 

h1 (v) COMPUTER READABLE FORM: 

^ (A) MEDIUM TYPE: Floppy disk 

"f. (B) COMPUTER: IBM PC compatible 

* (C) OPERATING SYSTEM: PC- DOS/MS -DOS 

J,, (D) SOFTWARE: Patentin Release #1.0, Version #1.25 (EPO) 

tt (vi) CURRENT APPLICATION DATA: 

W (A) APPLICATION NUMBER: 08/833,752 

=K (B) FILING DATE: 9-APR-1997 

O (C) CLASSIFICATION: 



(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Altman, Daniel E 

(B) REGISTRATION NUMBER: 34,115 

(C) REFERENCE/DOCKET NUMBER: 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 792 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 240.. 791 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

GAATTCCCCC AACAGAGCCA AGCTCTCCAT CTAGTGGACA GGGAAGCTAG CAGCAAACCT 60 

TCCCTTCACT ACAAAACTTC ATTGCTTGGC CAAAAAGAGA GTTAATTCAA TGTAGACATC 120 

TATGTAGGCA ATTAAAAACC TATTGATGTA TAAAACAGTT TGCATTCATG GAGGGCAACT 180 

AAATACATTC TAGGACTTTA TAAAAGATCA CTTTTTATTT ATGCACAGGG TGGAACAAG 239 



ATG 


GAT 


TAT 


CAA 


GTG 


TCA 


AGT 


CCA 


ATC 


TAT 


GAC 


ATC 


AAT 


TAT 


TAT 


ACA 


287 


Met 


Asp 


Tyr 


Gin 


Val 


Ser 


Ser 


Pro 


He 


Tyr 


Asp 


He 


Asn 


Tyr 


Tyr 


Thr 




1 








5 










10 










15 






TCG 


GAG 


CCC 


TGC 


CAA 


AAA 


ATC 


AAT 


GTG 


AAG 


CAA 


ATC 


GCA 


GCC 


CGC 


CTC 


335 


Ser 


Glu 


Pro 


Cys 


Gin 


Lys 


He 


Asn 


Val 


Lys 


Gin 


He 


Ala 


Ala 


Arg 


Leu 










20 










25 










30 








CTG 


CCT 


CCG 


CTC 


TAG 


TCA 


CTG 


GTG 


TTC 


ATC 


TTT 


GGT 


TTT 


GTG 


GGC 


AAC 


383 


LeuS2Pro 


Pro 


Leu 


Tyr 


Ser 


Leu 


Val 


Phe 


He 


Phe 


Gly 


Phe 


Val 


Gly 


Asn 





35 40 45 



ATaOCTG GTC ATC CTC ATC CTG ATA AAC TGC AAA AGG CTG AAG AGC ATG 431 
MetlyLeu Val He Leu He Leu He Asn Cys Lys Arg Leu Lys Ser Met 

CO 50 55 60 

ACtS^GAC ATC TAG CTG CTC AAC CTG GCC ATC TCT GAC CTG TTT TTC CTT 479 
ThrgjAsp He Tyr Leu Leu Asn Leu Ala He Ser Asp Leu Phe Phe Leu 
65i7i 70 75 80 

CTT"ACT GTC CCC TTC TGG GCT CAC TAT GCT GCC GCC CAG TGG GAC TTT 527 
Leu?^rhr Val Pro Phe Trp Ala His Tyr Ala Ala Ala Gin Trp Asp Phe 

?S 85 90 . 95 

GGAjtAAT ACA ATG TGT CAA CTC TTG ACA GGG CTC TAT TTT ATA GGC TTC 575 
Gly5^sn Thr Met Cys Gin Leu Leu Thr Gly Leu Tyr Phe He Gly Phe 

X 100 105 110 

TTQ^=¥CT GGA ATC TTC TTC ATC ATC CTC CTG ACA ATC GAT AGG TAG CTG 623 



Phe-'^er 


Gly 


He 


Phe 


Phe 


He 


He 


Leu 


Leu 


Thr 


He 


Asp 


Arg 


Tyr 


Leu 








115 










120 










125 










GCT 


GTC 


GTC 


CAT 


GCT 


GTG 


TTT 


GCT 


TTA 


AAA 


GCC 


AGG 


ACG 


GTC 


ACC 


TTT 


671 


Ala 


Val 


Val 


His 


Ala 


Val 


Phe 


Ala 


Leu 


Lys 


Ala Arg 


Thr 


Val 


Thr 


Phe 






130 










135 










140 












GGG 


GTG 


GTG 


ACA 


AGT 


GTG 


ATC 


ACT 


TGG 


GTG 


GTG 


GCT 


GTG 


TTT 


GCG 


TCT 


719 


Gly 


Val 


Val 


Thr 


Ser 


Val 


He 


Thr 


Trp 


Val 


Val 


Ala 


Val 


Phe 


Ala 


Ser 




145 










150 










155 










160 




CTC 


CCA 


GGA 


ATC 


ATC 


TTT 


ACC 


AGA 


TCT 


CAA 


AAA 


GAA 


GGT 


CTT 


CAT 


TAC 


767 


Leu 


Pro 


Gly 


He 


He 
165 


Phe 


Thr 


Arg 


Ser 


Gin 
170 


Lys 


Glu 


Gly 


Leu 


His 
175 


Tyr 




ACC 


TGC 


AGC 


TCT 


CAT 


TTT 


CCA 


TAC 


A 
















792 


Thr 


Cys 


Ser 


Ser 
180 


His 


Phe 


Pro 


Tyr 





















(2) INFORMATION FOR SEQ ID NO : 2: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1477 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 240.. 1295 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

GAATTCCCCC AACAGAGCCA AGCTCTCCAT CTAGTGGACA GGGAAGCTAG CAGCAAACCT 60 

TCCCTTCACT ACAAAACTTC ATTGCTTGGC CAAAAAGAGA GTTAATTCAA TGTAGACATC 120 

TATGTAGGCA ATTAAAAACC TATTGATGTA TAAAACAGTT TGCATTCATG GAGGGCAACT 180 

AJ^TACATTC TAGGACTTTA TAAAAGATCA CTTTTTATTT ATGCACAGGG TGGAACAAG 239 



atS gat 


TAT 


CAA 


GTG 


TCA 


AGT 


CCA 


ATC 


TAT 


GAC 


ATC 


AAT 


TAT 


TAT 


ACA 


287 


MeS Asp 


Tyr 


Gin 


Val 
5 


Ser 


Ser 


Pro 


He 


Tyr 
10 


Asp 


He 


Asn 


Tyr 


Tyr 
15 


Thr 




TCH GAG 


CCC 


TGC 


CAA 


AAA 


ATC 


AAT 


GTG 


AAG 


CAA 


ATC 


GCA 


GCC 


CGC 


CTC 


335 




Pro 


Cys 


Gin 


Lys 


He 


Asn 


Val 


Lys 


Gin 


He 


Ala 


Ala 


Arg 


Leu 








20 










25 










30 






CT|| CCT 


CCG 


CTC 


TAC 


TCA 


CTG 


GTG 


TTC 


ATC 


TTT 


GGT 


TTT 


GTG 


GGC 


AAC 


383 


Leji Pro 


Pro 


Leu 


Tyr 


Ser 


Leu 


Val 


Phe 


He 


Phe 


Gly 


Phe 


Val 


Gly 


Asn 






35 










40 










45 








atJ CTG 


GTC 


ATC 


CTC 


ATC 


CTG 


ATA 


AAC 


TGC 


AAA 


AGG 


CTG 


AAG 


AGC 


ATG 


431 


MeW; Leu 


Val 


He 


Leu 


He 


Leu 


He 


Asn 


Cys 


Lys 


Arg 


Leu 


Lys 


Ser 


Met 














55 










60 










AC j GAC 


ATC 


TAC 


CTG 


CTC 


AAC 


CTG 


GCC 


ATC 


TCT 


GAC 


CTG 


TTT 


TTC 


CTT 


479 


Th^ Asp 


He 


Tyr 


Leu 


Leu 


Asn 


Leu 


Ala 


He 


Ser 


Asp 


Leu 


Phe 


Phe 


Leu 












70 










75 








80 




CTT ACT 


GTC 


CCC 


TTC 


TGG 


GCT 


CAC 


TAT 


GCT 


GCC 


GCC 


CAG 


TGG 


GAC 


TTT 


527 


Leu Thr 


Val 


Pro 


Phe 
85 


Trp 


Ala 


His 


Tyr 


Ala 
90 


Ala 


Ala 


Gin 


Trp 


Asp 
95 


Phe 




GGA AAT 


ACA 


ATG 


TGT 


CAA 


CTC 


TTG 


ACA 


GGG 


CTC 


TAT 


TTT 


ATA 


GGC 


TTC 


575 


Gly Asn 


Thr 


Met 


Cys 


Gin 


Leu 


Leu 


Thr 


Gly 


Leu 


Tyr 


Phe 


He 


Gly 


Phe 








100 










105 










110 






TTC TCT 


GGA 


ATC 


TTC 


TTC 


ATC 


ATC 


CTC 


CTG 


ACA 


ATC 


GAT 


AGG 


TAC 


CTG 


623 


Phe Ser 


Gly 


He 


Phe 


Phe 


He 


He 


Leu 


Leu 


Thr 


He 


Asp 


Arg 


Tyr 


Leu 






115 










120 










125 








GCT GTC 


GTC 


CAT 


GCT 


GTG 


TTT 


GCT 


TTA 


AAA 


GCC 


AGG 


ACG 


GTC 


ACC 


TTT 


671 


Ala Val 


Val 


His 


Ala 


Val 


Phe 


Ala 


Leu 


Lys 


Ala 


Arg 


Thr 


Val 


Thr 


Phe 




130 










135 










140 












GGG GTG 


GTG 


ACA 


ACT 


GTG 


ATC 


ACT 


TGG 


GTG 


GTG 


GCT 


GTG 


TTT 


GCG 


TCT 


719 


Gly Val 


Val 


Thr 


Ser 


Val 


He 


Thr 


Trp 


Val 


Val 


Ala 


Val 


Phe 


Ala 


Ser 




145 








150 








155 










160 




CTC CCA 


GGA 


ATC 


ATC 


TTT 


ACC 


AGA 


TCT 


CAA 


AAA 


GAA 


GGT 


CTT 


CAT 


TAC 


767 


Leu Pro 


Gly 


He 


He 


Phe 


Thr 


Arg 


Ser 


Gin 


Lys 


Glu 


Gly 


Leu 


His 


Tyr 





165 170 175 
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i U i 




rprprp 
ill 


LLA 


1 AC 


AGi 


CAG 


m 7\ m 

1 Ai 


CAA 


i iC 


iGG 


TV TV 

AAG 


'A TV m 

AAT 


815 


nn rc^ 
1 ilir v^y s 


Ser* 


O ^ *V" 

J_ o u 


xlXS 


Ffie 


irro 


Tyr 


Ser 

1 fl c; 

XOD 


GXn 


Tyr 


Gxn 


Fne 


Trp 
1 on 

X J u 


Lys 


TV M-*^ 

Asn 




rpmpi riAr* 

1 i U UAvj 




mrpTv 
1 lA 


A Ar* 

AAG 


AT" A 


GIL 


A TT* 
A J. C 


i iG 


GGG 




Gi C 


CiG 


CCG 


CiG 


Ci i 


863 


irue <aXn 


i LIT 


Leu 


Lys 


X xe 


V ax 


X xe 

9 n n 
z u u 


Leu 


Gxy 


Leu 


vax 


Leu 

one: 

Z U 3 


Pro 


Leu 


Leu 








ATT* 


Tan 




1 V^VJ 




ATT* 
/\1 


L. l/\ 


AAA 


ACT 




i i 


CGG 


iGi 


yii 


vax lyiBu 


V ax 


X xe 


uys 


iyr 


C! £i "v* 


vaxy 


xxe 


Leu 


Lys 


Thr 


Leu 


Leu 


Arg 


Cys 




210 










215 










220 
















A Ar* 


A AP 






Apr* 






AGG 


CTT 


ATT' 


i i 


ArT* 

iiCC 


AiC 


Q C Q 






Lys 


Lys 


Arg 


rllS 


Arg 


Al ^ 

AX a 


vax 


Arg 


Leu 


Tl 

xxe 


Fne 


inr 


Tl » 

xxe 




o 9 c 
Z Z O 








9 n 

Z J u 










235 










Z^ U 




A A T^T* 


rtrnrn 
kjL 1. 




rprprp 
1 i 1 




1 1 v.. 


1 


vjU 1 


/-I /-%/-! 

ccc 


TAC 


AAC 


AT'T' 
fli i 


Gi C 


C i i 


Ci C 


XUU / 


iviec IJ-S 


vax 


Tyr 


FJie 


Leu 


File 


Trp 


A n -V 
AX a 


Pro 


Tyr Asn 


T 1 ^ 

xxe 


vax 


Leu 


Leu 










O A c; 










o n 

Z 3 u 










O c; 
Zoo 






CTG AAC 


ACC 


TTC 


CAG 


TV 7\ 

GAA 


TTC 


TTT 


GGC 


CTG 


AAT 


AAT 


m/~i/^ 

TGC 


T\ /^m 

AGT 


AGC 


m/~irn 

TCT 


1055 


Lieu Asn 


Thr 


riie 

o n 

Z D u 


Gxn 


GXU 


riie 


File 


Gxy 

O c; 
o O 


Leu 


Asn 


Asn 


Cys 


Ser 

o n 


Ser 


Ser 




A A 7\ 


I IG 


GAL. 


T\ 

UAA 


GCi 


AiG 


/-^ TV /-I 

CAG 


GTG 


ACA 


GAG 


ACT 


CTT 


GGG 


TV m/^ 

ATG 


T\ /^/^ 

ACG 


1103 


AsriJ Arg 


Leu 


Asp 


GXn 


T\ 1 —I 

AX a 


Met 


GXn 


VaX 


Tnr 


Glu 


Thr 


Leu 


GXy 


Met 


Thr 






O "7 C 










o o n 
z o U 










o o cr 
285 














AiC 


A A 

AAC 


LLC 


AiC 


Ai C 


lAi 


GCC 


TTT 


GTC 


GGG 


/-i T\ <^ 

GAG 


AAG 


■ till 

TTC 


1151 


Hisy Cys 


Cys 


lie 


Asn 


Pro 


xXe 


lie 


Tyr 


Ala 


Phe 


Val 


Gly 


Glu 


Lys 


Phe 














O Q C 










300 












AGAT^ AAC 


TAC 


CTC 


TTA 


GTC 


mm 1^ 

TTC 


mm /~i 

TTC 


CAA 


AAG 


CAC 


ATT 


GCC 


AAA 


CGC 


TTC 


1199 


Ar^f Asn 


Tyr 


Leu 


Leu 


Val 


Phe 


Phe 


Gin 


Lys 


His 


He 


Ala 


Lys 


Arg 


Phe 




305" 








310 










315 










320 




TG|=j AAA 


TGC 


TOT 


TCT 


ATT 


TTC 


CAG 


CAA 


GAG 


GCT 


CCC 


GAG 


CGA 


GCA 


AGC 


1247 


Cy^; Lys 


Cys 


Cys 


Ser 


He 


Phe 


Gin 


Gin 


Glu 


Ala 


Pro 


Glu 


Arg 


Ala 


Ser 










325 










330 










335 






TCit GTT 


TAC 


ACC 


CGA 


TCC 


ACT 


GGG 


GAG 


CAG 


GAA 


ATA 


TCT 


GTG 


GGC 


TTG 


1295 


Seft Val 


Tyr 


Thr 


Arg 


Ser 


Thr 


Gly 


Glu 


Gin 


Glu 


He 


Ser 


Val 


Gly 


Leu 








340 










345 










350 









TGAtlACGGAC TCAAGTGGGC TGGTGACCCA GTCAGAGTTG TGCACATGGC TTAGTTTTCA 1355 

TACACAGCCT GGGCTGGGGG TNGGTTGGNN GAGGTCTTTT TTAAAAGGAA GTTACTGTTA 1415 

TAGAGGGTCT AAGATTCATC CATTTATTTG GCATCTGTTT AAAGTAGATT AGATCCGAAT 1475 

TC 1477 



{2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1442 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 
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(B) LOCATION: 240.. 884 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

GAATTCCCCC AACAGAGCCA AGCTCTCCAT CTAGTGGACA GGGAAGCTAG CAGCAAACCT 60 

TCCCTTCACT ACAAAACTTC ATTGCTTGGC CAAAAAGAGA GTTAATTCAA TGTAGACATC 120 

TATGTAGGCA ATTAAAAACC TATTGATGTA TAAAACAGTT TGCATTCATG GAGGGCAACT ISO 

AAATACATTC TAGGACTTTA TAAAAGATCA CTTTTTATTT ATGCACAGGG TGGAACAAG 239 



ATf^ HAT 


TAT 


PAA 


HTH 


TPA 


AHT 


PPA 


A rnp 
-fix C 


X AX 


P 

CAC 


AX C 


A A T" 
AAX 


TAT 


TAT 


ACA 


287 




lyr 


PI n 


V dx 


Q o-K* 
OCX 


OCX 


xrro 


T 1 £i 

lie 


Tyr 


Asp 


Tl A 

lie 


Asn 


Tyr 


Tyr 


Thr 




X 






c 










1 U 










15 






TCG GAG 




Tnp 




a a 2i 


iil C 


a AT" 
AAX 


CXC 


A A 

AAC 


OA A 

CAA 


AXC 


CCA 


GCC' CGC 


CTC 


335 




irxO 


ret 

v.^ys 


vjin 


Lys 


T1 o 

lie 


Asn 


Val 


Lys 


Gin 


He 


Ala 


Ala 


Arg 


Leu 
















Z D 










30 












i AC 


1L.A 


CIC 


CiG 


TTC 


ATC 


TTT 


GGT 


TTT 


GTG 


GGC 


AAC 


383 


ijeu fTO 


Pro 


Leu 


Tyr 


Ser 


Leu 


val 


Pne 


He 


Phe 


Gly 


Phe 


XT— T 

Val 


Gly 


Asn 




i (^-sp \« 1 ^ 












4 U 








45 














Al U 


CIC 


A Ti A 
AXA 


A A 

AAC 


TGC 


AAA 


AGG 


CTG 


AAG 


AGC 


ATG 


431 


Moir?^"' T on 
iuc l^y JLieU 


vax 


T 1 o 

lie 


Leu 


T1 £i 

lie 


Leu 


Tl ^ 

lie 


Asn 


Cys 


Lys 


Arg 


Leu 


Lys 


Ser 


Met 














55 










60 














ATT' 






CI U 


A A 
AAC 


CiC 


GCC 


ATC 


TCT 


GAC 


CTG 


TTT 


TTC 


CTT 


479 




lie 


Tyr 


Leu 


Leu 


Asn 


Leu 


Ala 


He 


Ser 


Asp 


Leu 


Pile 


Phe 


Leu 












/ u 










75 








80 






VJ i 




i lU 




CCl 


CAC 


TAT 


GCT 


GCC 


GCC 


CAG 


TGG 


GAC 


TTT 


527 




vai 


Pro 


Pne 


Trp 


Ala 


His 


Tyr 


Ala 


Ala 


Ala 


Gin 


Trp 


Asp 


Phe 










O D 




















95 














/*^A A 

CAA 


CTC 


TTG 


ACA 


GGG 


CTC 


TAT 


TTT 


ATA 


GGC 


TTC 


575 




nr* V\ -V* 

J. nr 




cys 


cm 


Leu 


Leu 


Thr 


Gly 


Leu 


Tyr 


Phe 


He 


Gly 


Phe 








1 on 

X u u 










IUd 










110 










lie 


run 
IXC 


ATC 


ATC 


CTC 


CTG 


ACA 


ATC 


GAT 


AGG 


TAG 


CTG 


623 


"D V» i:^?'^' O i=i -V- 


Cjiy 


T 1 £i 

lie 


Pile 


Pne 


He 


He 


Leu 


Leu 


Thr 


He 


Asp 


Arg 


Tyr 


Leu 
















120 










125 






GC't'^GTC 


GTC 


CAT 


GCT 


GTG 


TTT 


GCT 


TTA 


AAA 


GCC 


AGG 


ACG 


GTC 


ACC 


TTT 


671 


Ala Val 


Val 


His 


Ala 


Val 


Phe 


Ala 


Leu 


Lys 


Ala 


Arg 


Thr 


Val 


Thr 


Phe 




130 










135 








140 












GGG GTG 


GTG 


ACA 


AGT 


GTG 


ATC 


ACT 


TGG 


GTG 


GTG 


GCT 


GTG 


TTT 


GCG 


TCT 


719 


Gly Val 


Val 


Thr 


Ser 


Val 


He 


Thr 


Trp 


Val 


Val 


Ala 


Val 


Phe 


Ala 


Ser 




145 








150 








155 










160 




CTC CCA 


GGA 


ATC 


ATC 


TTT 


ACC 


AGA 


TCT 


CAA 


AAA 


GAA 


GGT 


CTT 


CAT 


TAC 


767 


Leu Pro 


Gly 


He 


He 


Phe 


Thr 


Arg 


Ser 


Gin 


Lys 


Glu 


Gly 


Leu 


His 


Tyr 










165 










170 










175 




ACC TGC 


AGC 


TCT 


CAT 


TTT 


CCA 


TAG 


ATT 


AAA 


GAT 


AGT 


CAT 


CTT 


GGG 


GCT 


815 


Thr Cys 


Ser 


Ser 


His 


Phe 


Pro 


Tyr 


He 


Lys 


Asp 


Ser 


His 


Leu 


Gly 


Ala 








180 










185 










190 






GGT CCT 


GCC 


GCT 


GCT 


TGT 


CAT 


GGT 


CAT 


CTG 


CTA 


CTC 


GGG 


AAT 


CCT 


AAA 


863 


Gly Pro 


Ala 


Ala 


Ala 


Cys 


His 


Gly 


His 


Leu 


Leu 


Leu 


Gly 


Asn 


Pro 


Lys 






195 










200 










205 








AAC TCT 


GCT 


TCG 


GTG 


TCG 


AAA 


TGAGAAGAAG AGGCACAGGG CTGTGAGGCT 


914 


Asn Ser 


Ala 


Ser 


Val 


Ser 


Lys 























210 215 
TATCTTCACC ATCATGATTG TTTATTTTCT CTTCTGGGCT CCCTACAACA TTGTCCTTCT 974 
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CCTGAACACC TTCCAGGAAT TCTTTGGCCT GAATAATTGC AGTAGCTCTA ACAGGTTGGA 1034 

CCAAGCTATG CAGGTGACAG AGACTCTTGG GATGACGCAC TGCTGCATCA ACCCCATCAT 1094 

CTATGCCTTT GTCGGGGAGA AGTTCAGAAA CTACCTCTTA GTCTTCTTCC AAAAGCACAT 1154 

TGCCAAACGC TTCTGCAAAT GCTGTTCTAT TTTCCAGCAA GAGGCTCCCG AGCGAGCAAG 1214 

CTCAGTTTAC ACCCGATCCA CTGGGGAGCA GGAAATATCT GTGGGCTTGT GACACGGACT 1274 

CAAGTGGGCT GGTGACCCAG TCAGAGTTGT GCACATGGCT TAGTTTTCAT ACACAGCCTG 1334 

GGCTGGGGGT GGTTGGGAGG TCTTTTTTAA AAGGAAGTTA CTGTTATAGA GGGTCTAAGA 1394 

TTCATCCATT TATTTGGCAT CTGTTTAAAG TAGATTAGAT CCGAATTC 1442 



(2) INFORMATION FOR SEQ ID NO: 4; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 184 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



^^^(ii) MOLECULE TYPE: protein 

^iB{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



MetiyAsp 


Tyr 


Gin 


Val 


Ser 


Ser 


Pro 


He 


Tyr 


Asp 


He 


Asn 


Tyr 


Tyr Thr 


IB 






5 










10 










15 


Ser^^iGlu 


Pro 


Cys 


Gin 


Lys 


He 


Asn 


Val 


Lys 


Gin 


He 


Ala 


Ala 


Arg Leu 






20 










25 










30 


LeuyPro 


Pro 


Leu 


Tyr 


Ser 


Leu 


Val 


Phe 


He 


Phe 


Gly 


Phe 


Val 


Gly Asn 




35 










40 










45 




Met^=^^jLeu 


Val 


He 


Leu 


He 


Leu 


He 


Asn 


Cys 


Lys 


Arg 


Leu 


Lys 


Ser Met 


!•;■"{; 5 0 










55 










60 








Thil==Asp 


He 


Tyr 


Leu 


Leu 


Asn 


Leu 


Ala 


He 


Ser 


Asp 


Leu 


Phe 


Phe Leu 










70 










75 








80 


Leif^Thr 


Val 


Pro 


Phe 


Trp 


Ala 


His 


Tyr 


Ala 


Ala 


Ala 


Gin 


Trp 


Asp Phe 








85 










90 










95 




Thr 


Met 


Cys 


Gin 


Leu 


Leu 


Thr 


Gly 


Leu 


Tyr 


Phe 


He 


Gly Phe 






100 










105 










110 




Phe Ser 


Gly 


He 


Phe 


Phe 


He 


He 


Leu 


Leu 


Thr 


He 


Asp 


Arg 


Tyr Leu 




115 










120 










125 




Ala Val 


Val 


His 


Ala 


Val 


Phe 


Ala 


Leu 


Lys 


Ala 


Arg 


Thr 


Val 


Thr Phe 


130 










135 










140 








Gly Val 


Val 


Thr 


Ser 


Val 


He 


Thr 


Trp 


Val 


Val 


Ala 


Val 


Phe 


Ala Ser 


145 








150 










155 








160 


Leu Pro 


Gly 


He 


He 


Phe 


Thr 


Arg 


Ser 


Gin 


Lys 


Glu 


Gly 


Leu 


His Tyr 








165 










170 










175 


Thr Cys 


Ser 


Ser 


His 


Phe 


Pro 


Tyr 




















180 























(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 352 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Asp Tyr Gin Val Ser Ser Pro He Tyr Asp He Asn Tyr Tyr Thr 

15 10 15 

Ser Glu Pro Cys Gin Lys He Asn Val Lys Gin He Ala Ala Arg Leu 

20 25 30 

Leu Pro Pro Leu Tyr Ser Leu Val Phe He Phe Gly Phe Val Gly Asn 

35 40 45 

Met Leu Val He Leu He Leu He Asn Cys Lys Arg Leu Lys' Ser Met 

50 55 60 

Thr Asp He Tyr Leu Leu Asn Leu Ala He Ser Asp Leu Phe Phe Leu 
S5 70 75 80 

Leu Thr Val Pro Phe Trp Ala His Tyr Ala Ala Ala Gin Trp Asp Phe 

85 90 95 

Gl^ Asn Thr Met Cys Gin Leu Leu Thr Gly Leu Tyr Phe He Gly Phe 
^ 100 105 110 

Ser Gly He Phe Phe He He Leu Leu Thr He Asp Arg Tyr Leu 
W 120 125 

Alffi Val Val His Ala Val Phe Ala Leu Lys Ala Arg Thr Val Thr Phe 

%j 130 135 140 

Gig Val Val Thr Ser Val He Thr Trp Val Val Ala Val Phe Ala Ser 
14S 150 155 160 

Leu- Pro Gly He He Phe Thr Arg Ser Gin Lys Glu Gly Leu His Tyr 

h 1^5 170 175 

Th|S, Cys Ser Ser His Phe Pro Tyr Ser Gin Tyr Gin Phe Trp Lys Asn 

180 185 190 

Ph^ Gln Thr Leu Lys He Val He Leu Gly Leu Val Leu Pro Leu Leu 

S 1^5 200 205 

VapMet Val He Cys Tyr Ser Gly He Leu Lys Thr Leu Leu Arg Cvs 

H 210 215 220 

Arg Asn Glu Lys Lys Arg His Arg Ala Val Arg Leu He Phe Thr He 
225 230 235 240 

Met He Val Tyr Phe Leu Phe Trp Ala Pro Tyr Asn He Val Leu Leu 

245 250 255 

Leu Asn Thr Phe Gin Glu Phe Phe Gly Leu Asn Asn Cys Ser Ser Ser 

260 265 270 

Asn Arg Leu Asp Gin Ala Met Gin Val Thr Glu Thr Leu Gly Met Thr 

275 280 285 

His Cys Cys He Asn Pro He He Tyr Ala Phe Val Gly Glu Lys Phe 

290 295 300 

Arg Asn Tyr Leu Leu Val Phe Phe Gin Lys His He Ala Lys Arg Phe 
305 310 315 320 

Cys Lys Cys Cys Ser He Phe Gin Gin Glu Ala Pro Glu Arg Ala Ser 

325 330 335 

Ser Val Tyr Thr Arg Ser Thr Gly Glu Gin Glu He Ser Val Gly Leu 
340 345 350 
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(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 215 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



Met Asp Tyr 


Gin 


Val 


Ser 


Ser 


Pro 


He 


Tyr Asp 


He 


Asn Tyr 


Tyr Thr 


JL 




b 










10 








15 


Ser Glu Pro 


Cys 


Gin 


Lys 


He 


Asn 


Val 


Lys 


Gin 


He 


Ala Ala 


Arg Leu 




20 










25 








30 


Leu Pro Pro 


Leu 


Tyr 


Ser 


Leu 


Val 


Phe 


He 


Phe 


Gly Phe Val 


Gly Asn 


35 










40 










45 


Met Leu Val 


He 


Leu 


He 


Leu 


He 


Asn 


Cys 


Lys 


Arg 


Leu Lys 


Ser Met 


P 50 








55 










60 






ThiS Asp He 


Tyr 


Leu 


Leu 


Asn 


Leu 


Ala 


He 


Ser Asp 


Leu Phe 


Phe Leu 


6£ 






70 










75 






80 


LeUJ Thr Val 


Pro 


Phe 


Trp 


Ala 


His 


Tyr 


Ala 


Ala 


Ala 


Gin Trp 


Asp Phe 






85 










90 








95 


Gl^ Asn Thr 


Met 


Cys 


Gin 


Leu 


Leu 


Thr 


Gly Leu 


Tyr 


Phe He 


Gly Phe 




ICQ 










105 








110 




PhSi Ser Gly 


He 


Phe 


Phe 


He 


He 


Leu 


Leu 


Thr 


He 


Asp Arg 


Tyr Leu 


r 115 










120 










125 




Aim Val Val 


His 


Ala 


Val 


Phe 


Ala 


Leu 


Lys 


Ala 


Arg 


Thr Val 


Thr Phe 


S 130 








135 








140 






Glf^Val Val 


Thr 


Ser 


Val 


He 


Thr 


Trp 


Val 


Val 


Ala 


Val Phe 


Ala Ser 








150 










155 






160 


Left; Pro Gly 


He 


He 


Phe 


Thr 


Arg 


Ser 


Gin 


Lys 


Glu Gly Leu 


His Tyr 






165 










170 








175 


Thit Cys Ser 


Ser 


His 


Phe 


Pro 


Tyr 


He 


Lys 


Asp 


Ser 


His Leu 


Gly Ala 




180 










185 








190 


Gly Pro Ala 


Ala 


Ala 


Cys 


His 


Gly 


His 


Leu 


Leu 


Leu 


Gly Asn 


Pro Lys 


195 










200 










205 


Asn Ser Ala 


Ser 


Val 


Ser 


Lys 
















210 








215 

















(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 360 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:7: 
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Leu 


iber 


j.nr 


oer 


Arg 


oer 


Arg 


T>V4 ^ 

Pile 


He 


Arg Asn 


Thr 


Asn 


Glu 


Ser 


1 








5 










10 










15 




Gly 


Glu 


Glu 


Val 


Thr 


Thr 


Phe 


Phe 


Asp 


Tyr Asp 


Tyr 


Gly Ala 


Pro 


Cys 








20 










25 










30 






JrlXS 


Lys 


irne 

35 


Asp 


vai 


Lys 




Tl ^ 

lie 

A A 

40 


Caiy 


Ala 


Gin 


Leu 


Leu 
45 


Pro 


Pro 


Leu 


iyr 


OCX 


J-jfc: U. 


V ciX 




X X ti 


JrXiti 


vjxy 


Jrne 


Val 


Gly Asn 


Met 


Leu 


vai 


vai 




50 










55 










60 










Leu 


lie 


Leu 


lie 


Asn 


cys 


Lys 


Lys 


Leu 


Lys 


Cys 


Leu 


Thr Asp 


T 1 

lie 


Tyr 


65 










70 










75 










80 


Leu 


Leu 


Asn 


Leu 


Aia 


Tl ^ 

lie 


Ser 


Asp 


Leu 


Leu 
90 


Phe 


He 


He 


Thr 


Leu 

A C 

95 


Pro 


Leu 


Trp 


Ala 


His 


Ser 


TV T — 1 

Ala 


TV 1 -1 

Ala 


Asn 


Glu 


Trp 


Val 


Phe 


Gly Asn 


TV T 

Ala 


Met 








1 A A 

100 










105 










110 






V— y S 


Lys 


Leu 


xrlie 


i niT 


vjiy 


Leu 


Tyr 


rllS 


He 


Gly Tyr 


Phe 


Gly 


Cjiy 


lie 






115 










120 










125 








File 


File 


lie 


lie 


Leu 


Leu 


Thr 


Tl « 

lie 


Asp 


Arg 


Tyr 


Leu 


Ala 


He 


Val 


TT J — 

His 




130 










135 






140 










■A Is -4 


val 


Pile 


Ala 


Leu 


Lys 


Ala 


Arg 


Thr 


Val 


Thr 


Phe 


Gly Val 


Val 


Thr 












150 










155 










160 




Val 


He 


Thr 


Trp 


Leu 


Val 


Ala 


Val 


Phe 


Ala 


Ser 


Val 


Pro 


Gly 


He 










165 










170 










175 






Pile 


Thr 


Lys 


Cys 


Gin 


Lys 


Glu 


Asp 


Ser 


Val 


Tyr 


Val 


Cys 


Gly 


Pro 








^ O A 

180 










185 










190 






Tils 


Pile 


Pro 


Arg 


Gly 


Trp 


Asn 


Asn 


Phe 


His 


Thr 


He 


Met 


Arg 


Asn 


He 






195 










200 










205 








Lq^u 


(aiy 


Leu 


val 


Leu 


Pro 


Leu 


Leu 


He 


Met 


Val 


He 


Cys 


Tyr 


Ser 


Gly 




210 










215 










220 








T T 

z 4 p. 


Leu 


Lys 


Thr 


Leu 


Leu 

AAA 

230 


Arg 


Cys 


Arg 


Asn 


Glu 
235 


Lys 


Lys 


Arg 


His 


Arg 
240 




V ai 


Arg" 


vai 


lie 


Fne 


Thr 


lie 


Met: 


He 


Val 


Tyr 


Phe 


Leu 


Pile 


Trp 










A /I C" 

245 










250 










255 


Pro 


Tyr 


Asn 


He 


Val 


He 


Leu 


Leu 


Asn 


Thr 


Phe 


Gin 


Glu 


Phe 


Phe 








260 










265 










270 






Gly 


Leu 


Ser 


Asn 


Cys 


Glu 


Ser 


Thr 


Ser 


Gin 


Leu Asp 


Gin 


Ala 


He 


Gin 






275 










280 










285 








Val 


Thr 


Glu 


Thr 


Leu 


Gly 


Met 


Thr 


His 


Cys 


Cys 


He 


Asn 


Pro 


He 


He 




290 










295 






300 










Tyr 


Ala 


Phe 


Val 


Gly 


Glu 


Lys 


Phe 


Arg 


Arg 


Tyr 


He 


Ser 


Val 


Phe 


Phe 


305 










310 










315 










320 


Arg 


Lys 


His 


He 


Xaa 
325 


Xaa 


Xaa 


Phe 


Cys 


Lys 
330 


Gin 


Cys 


Pro 


Val 


Phe 
335 


Tyr 


Arg 


Glu 


Thr 


Val 
340 


Asp 


Gly 


Val 


Thr 


Ser 
345 


Thr 


Asn 


Thr 


Pro 


Ser 
350 


Thr 


Gly 


Glu 


Gin 


Glu 
355 


Val 


Ser 


Ala 


Gly 


Leu 
360 



















(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 55 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



Met 


Thr 


Thr 


Ser 


He 


Asp 


Thr 


Val 


Glu 


Thr 


Pne 


Gly 


Thr 


Thr 


Ser 


Tyr 


1 








D 










XU 










iD 




Tyr 


Asp 


Asp 


Val 


Gly 


Leu 


Leu 


Cys 


Glu 


Lys 


Ala 


Asp 


Thr 


Arg 


Ala 


Leu 








20 










25 
















Met 


Ala 


Gin 


Phe 


Val 


Pro 


Pro 


Leu 


Tyr 


Ser 


Leu 


Val 


Pne 


Thr 


Val 


Gly 






35 










40 










45 








Leu 


lie 


Gly 


Asn 


Val 


Val 


Val 


Val 


Met 


He 


Leu 


He 


Lys 


Tyr 


Arg 


Arg 




50 










55 










60 










lie 


Arg 


lie 


Met 


Thr 


Asn 


He 


Tyr 


Leu 


Leu 


Asn 


Leu 


Ala 


He 


Ser 


Asp 


65 










70 










75 










80 


Leu 


Leu 


Phe 


He 


Val 


Thr 


Leu 


Pro 


Phe 


Trp 


Thr 


His 


Tyr 


Val 


Arg 


Gly 










85 










90 










95 






Asn 


Trp 


Val 


Phe 


Gly 


His 


Gly 


Met 


Cys 


Asn 


Leu 


He 


Ser 


Gly 


Phe 








100 










105 










110 








His 


Thr 


Gly 


Leu 


Tyr 


Ser 


Glu 


He 


Phe 


Phe 


He 


He 


Leu 


Leu 


Thr 






115 










120 










125 










Asp 


Arg 


Tyr 


Leu 


Ala 


He 


Val 


His 


Ala 


Val 


Phe 


Ala 


He 


Arg 


Ala 




130 










135 










140 












Thr 


Val 


Thr 


Phe 


Gly 


Val 


He 


Thr 


Ser 


He 


Val 


Thr 


Trp 


Gly 


He 












150 










155 










160 




Val 


lie 


Ala 


Ala 


Leu 


Pro 


Glu 


Phe 


He 


Phe 


Tyr 


Glu 


Thr 


Glu 


Glu 


CO 








165 










170 










175 




Leu 


Phe 


Glu 


Glu 


Thr 


He 


Cys 


Ser 


Ala 


Leu 


Tyr 


Pro 


Glu 


Asp 


Thr 


Val 








180 










185 










190 








Ser 


Trp 


Arg 


His 


Phe 


His 


Thr 


He 


Arg 


Met 


ml 

Thr 


He 


Phe 


Cys 


Leu 






195 










200 










205 








V&l 


Leu 


Pro 


Leu 


Leu 


Val 


Met 


Ala 


He 


Cys 


Tyr 


Thr 


Gly 


He 


He 


Lys 




210 










215 










220 










Thr 


Leu 


Leu 


Arg 


Cys 


Pro 


Xaa 


Xaa 


Xaa 


Lys 


Tyr 


Lys 


Ala 


He 


Arg 


Leu 


225 










230 










235 










240 


He 


Phe 


Val 


He 


Met 


Ala 


Val 


Phe 


Phe 


He 


Glu 


Trp 


Thr 


Pro 


Tyr 


Asn 










245 










250 










255 




Val 


Ala 


He 


Leu 


He 


Ser 


Ser 


Tyr 


Gin 


Ser 


Leu 


Leu 


Phe 


Gly 


Asn 


Asn 








260 










265 










270 






Cys 


Glu 


Arg 


Ser 


Lys 


His 


Leu 


Asp 


Leu 


Val 


Met 


He 


Val 


Thr 


Glu 


Val 






275 










280 










285 








He 


Ala 


Tyr 


Ser 


His 


Cys 


Cys 


Met 


Asn 


Glu 


Val 


He 


Tyr 


Ala 


Phe 


Val 




290 










295 










300 










Gly 


Glu 


Arg 


Phe 


Arg 


Lys 


Tyr 


He 


Arg 


His 


Phe 


Phe 


His 


Arg 


His 


Leu 


305 










310 










315 










320 


Leu 


Met 


His 


Leu 


Gly 


Arg 


Tyr 


He 


Pro 


Phe 


Leu 


Pro 


Xaa 


Xaa 


Xaa 


He 










325 










330 










335 




Glu 


Arg 


He 


Ser 


Ser 


Val 


Ser 


Pro 


Ser 


Thr 


Ala 


Glu 


Pro 


Glu 


He 


Ser 



340 345 350 



lie Val Phe 
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(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 55 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



Met 


Glu 


Thr 


Pro 


Asn 


Thr 


Thr 


Glu 


Asp 


Tyr 


Asp 


Thr 


Thr 


Thr 


Glu 


Phe 


± 


















X u 














Asp 


Tyr 


Gly 


Asp 

o r\ 
U 


Ala 


Thr 


Pro 


Cys 


Gin 


Lys 


Val 


Asn 


Glu 


Arg 


Ala 


Phe 




Ala 


Gin 


Leu 


Leu 


Pro 


Pro 


Leu 


Tyr 


Ser 


Leu 


Val 


Phe 


Val 


He 


Gly 
















A n 
^ U 










ft D 














A O T"! 






V dx 


V ClX 


T .An 


V ClX 


U\Z Li 


Val 

V d X 


Gin 


Tvr 
xyx 


T iVc; 
ju y o 


"**x y 




bO 








bb 










D u 










■5 -Iff 


Lys 


Asn 


Met 


Thr 


Ser 


He 


Tyr 


Leu 


Leu 


Asn 


Leu 


Ala 


He 


Ser 


Asp 








/ U 










/ D 










O U 


lieu 


Leu 


Phe 


He 


Phe 


Thr 


Leu 


Pro 


Phe 


Trp 


He 


Asp 


Tyr 


Lys 


Leu 


Lys 


■5s 








ob 




















Q 

J D 






Asp 


Trp 


Val 


Phe 


Gly 


Asp 


Ala 


Met 


Cys 


Lys 


He 


He 


Ser 


Gly 


Phe 








X u u 




















X X u 








Tyr 


Thr 


Gly 


Leu 


Tyr 


Ser 


Glu 


He 


Phe 


Phe 


He 


He 


Leu 


Leu 


Thr 






±xb 










XZ U 










X z o 










Asp 


Arg 


Tyr 


Leu 


Ala 


He 


Val 


His 


Ala 


Val 


Phe 


Ala 


He 


Arg 


Ala 




130 










135 










140 












Thr 


Val 


Thr 


Phe 


Gly 


Val 


He 


Thr 


Ser 


He 


He 


He 


Trp 


Ala 


He 


Ms 










150 










155 










160 


Ala 


He 


He 


Ala 


Ser 
165 


Met 


Pro 


Gly 


Leu 


Tyr 
170 


Phe 


Ser 


Lys 


Thr 


Gin 
175 


Trp 


Glu 


Phe 


Thr 


His 
180 


His 


Thr 


Cys 


Ser 


Leu 
185 


His 


Phe 


Pro 


His 


Glu 
190 


Ser 


Leu 


Arg 


Glu 


Trp 
195 


Lys 


Leu 


Phe 


Gin 


Ala 
200 


Leu 


Lys 


Leu 


Asn 


Leu 
205 


Phe 


Gly 


Leu 


Val 


Leu 
210 


Pro 


Leu 


Leu 


Val 


Met 
215 


He 


He 


Cys 


Tyr 


He 
220 


Gly 


He 


He 


Lys 


He 


Leu 


Leu 


Arg 


Arg 


Pro 


Asn 


Glu 


Lys 


Lys 


Ser 


Lys 


Ala 


Val 


Arg 


Leu 


225 










230 










235 










240 


He 


Phe 


Val 


He 


Met 
245 


He 


He 


Phe 


Phe 


Leu 
250 


Phe 


Trp 


He 


Pro 


Tyr 
255 


Asn 


Leu 


Thr 


He 


He 
260 


He 


Ser 


Val 


Phe 


Gin 
265 


Asp 


Phe 


Leu 


Phe 


Thr 
270 


His 


Glu 


Cys 


Glu 


Gin 
275 


Ser 


Arg 


His 


Leu 


Asp 
280 


Leu 


Ala 


Val 


Gin 


Val 
285 


Thr 


Glu 


Val 


He 


Ala 
290 


Tyr 


Thr 


His 


Cys 


Cys 
295 


Val 


Asn 


Glu 


Val 


He 
300 


Tyr 


Ala 


Phe 


Val 
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Gly 


Glu 


Arg 


Phe 


305 








Ala 


Val 


His 


Leu 


Glu 


Arg 


Val 


Ser 








340 


Ala 


Gly 


Phe 








355 





Arg Lys Tyr lie 
310 

Val Lys Trp Leu 
325 

Ser Thr Ser Pro 



Arg Gin Leu Glu 
315 

Pro Phe Leu Ser 
330 

Ser Thr Gly Glu 
345 



His Arg Arg Val 
320 

Val Asp Arg lie 
335 

His Glu lie 'Ser 
350 



(2) INFORMATION FOR SEQ ID NO: 10: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 360 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 





Asn 


Pro 


Thr 


Asp 


lie 


Ala 


Asp 


Thr 


Thr 


Leu 


Asp 


Glu 


Ser 


He 


Tyr 










5 










10 










15 






Asn 


Tyr 


Tyr 


Leu 


Tyr 


Glu 


Ser 


He 


Pro 


Lys 


Pro 


Cys 


Thr 


Lys 


Glu 








20 










25 










30 








He 


Lys 


Ala 


Phe 


Gly 


Glu 


Leu 


Phe 


Leu 


Pro 


Pro 


Leu 


Tyr 


Ser 


Leu 






35 










40 










45 










Glu 


Val 


Phe 


Gly 


Leu 


He 


Gly 


Asn 


Ser 


Val 


Val 


Val 


Leu 


Val 


Leu 




50 










55 










60 










pie 


Lys 


Tyr 


Lys 


Arg 


He 


Arg 


Ser 


Met 


Thr 


Asp 


Val 


Tyr 


Leu 


Leu 


Asn 












70 










75 










80 




Ala 


He 


Ser 


Asp 


Leu 


Leu 


Phe 


Val 


Phe 


Ser 


Leu 


Pro 


Phe 


Trp 


Gly 










85 










90 










95 




i^r 


Tyr 


Ala 


Ala 


Asp 


Gin 


Trp 


Val 


Phe 


Gly 


Leu 


Gly 


He 


Cys 


Lys 


Met 








100 










105 










110 






He 


Ser 


Trp 


Met 


Tyr 


Leu 


Val 


Gly 


Phe 


Tyr 


Ser 


Gly 


He 


Phe 


Phe 


Val 






115 










120 










125 








Met 


He 


Met 


Ser 


He 


Asp 


Arg 


Tyr 


Leu 


Ala 


He 


Val 


His 


Ala 


Val 


Glu 




130 










135 










140 










Xaa 


Xaa 


Xaa 


Ala 


Arg 


Thr 


He 


He 


Tyr 


Gly 


Val 


He 


Thr 


Ser 


Leu 


Ala 


145 










150 










155 










160 


Thr 


Trp 


Ser 


Val 


Ala 


Val 


Phe 


Ala 


Ser 


Leu 


Pro 


Gly 


Phe 


He 


Phe 


Ser 










165 










170 










175 




Thr 


Cys 


Tyr 


Thr 


Glu 


Arg 


Asn 


His 


Thr 


Tyr 


Cys 


Lys 


Thr 


Lys 


Tyr 


Ser 








180 










185 










190 






Leu 


Asn 


Ser 


Thr 


Thr 


Trp 


Lys 


Val 


Leu 


Ser 


Ser 


Leu 


Glu 


He 


Asn 


He 






195 










200 










205 








Leu 


Gly 


Leu 


Val 


He 


Pro 


Leu 


Gly 


He 


Met 


Leu 


Phe 


Cys 


Tyr 


Ser 


Met 




210 










215 










220 










He 


He 


Arg 


Thr 


Leu 


Gin 


His 


Cys 


Lys 


Asn 


Glu 


Lys 


Lys 


Asn 


Lys 


Ala 


225 










230 










235 










240 


Val 


Lys 


Met 


He 


Phe 


Ala 


Val 


Val 


Val 


Leu 


Phe 


Leu 


Gly 


Phe 


Trp 


Thr 



43 



Pro 


Tyr 


Asn 


He 








260 


lie 


Gin 


Asp 


Cys 






1^ r-T r— 

275 




Thr 


Glu 


Thr 


Leu 




290 






Phe 


Phe 


Leu 


Gly 


305 








Xaa 


Xaa 


Xaa 


Gly 


He 


Tyr 


Ser 


Ala 








340 


Asp 


His 


Asp 


Leu 



355 



245 








Val 


Leu 


Phe 


Leu 


Thr 


Phe 


Glu 


Arg 








280 


Ala 


Phe 


Val 


His 






295 




Glu 


Lys 


Phe 


Arg 




310 






Leu 


Phe 


Val 


He 


325 








Asp 


Thr 


Pro 


Ser 


His 


Asp 


Ala 


Leu 



360 





250 






Glu 


Thr 


Leu 


Val 


265 








Tyr 


Leu 


Asp 


Tyr 


Cys 


Cys 


Leu 


Asn 








300 


Lys 


Tyr 


He 


He 






315 




Cys 


Gin 


Tyr Cys 




330 






Ser 


Ser 


Tyr 


Thr 


345 













255 




Glu 


Leu 


Glu 


Val 




270 






Ala 


He 


Gin 


Ala 


285 








Pro 


He 


He 


Tyr 


Gin 


Leu 


Phe 


Lys 








320 


Gly 


Leu 


Leu 


Gin 






335 




Gin 


Ser 


Thr 


Met 




350 







(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 
^ (A) LENGTH: 54 amino acids 

W (B) TYPE: amino acid 

10 (C) STRANDEDNESS : single 

Si (D) TOPOLOGY: linear 

r (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



'Pfir 


Cys 


Ser 


Ser 


His 
5 


Phe 


Pro 


Tyr 


Ser 


Gin 
10 


Tyr 


Gin 


Phe 


Trp 


Lys 
15 


Asn 


Me 


Gin 


Thr 


Leu 


Lys 


He 


Val 


He 


Leu 


Gly 


Leu 


Val 


Leu 


Pro 


Leu 


Leu 








20 










25 










30 






Ml 


Met 


Val 


He 


Cys 


Tyr 


Ser 


Gly 


He 


Leu 


Lys 


Thr 


Leu 


Leu 


Arg 


Cys 






35 










40 










45 








Arg 


Asn 


Glu 


Lys 


Lys 


Arg 























50 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 147 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

TTTCCATACA GTCAGTATCA ATTCTGGAAG AATTTCCAGA CATTAAAGAT AGTCATCTTG 60 

GGGCTGGTCC TGCCGCTGCT TGTCATGGTC ATCTGCTACT CGGGAATCCT AAAAACTCTG 120 

CTTCGGTGTC GAAATGAGAA GAAGAGG 147 



44 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Phe Pro Tyr lie Lys Asp Ser His Leu Gly Ala Gly Pro Ala Ala Ala 

15 10 15 

Cys His Gly His Leu Leu Leu Gly Asn Pro Lys Asn Ser Ala Ser Val 
20 25 30 

Ser Lys 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



n (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 
TibAGGATCC AAGATGGATT ATCAAGT 
t. (2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
CTGATCTAGA GCCATGTGCA CAACTCT 

(2) INFORMATION FOR SEQ ID N0:16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



45 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
CCTGGCTGTC GTCCATGCTG ' 20 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CliARACTERISTICS : 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



CTGATCTAGA GCCATGTGCA CAACTCT 



27 



